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Abstract. In a paper of Chaudhuri and Dasgupta published in 2006, a certain 
stochastic model for "replicating character strings" (such as in DNA sequences) was stud- 
ied. In their model, a random "input" sequence was subjected to random mutations, 
insertions, and deletions, resulting in a random "output" sequence. In this note, their 
model will be set up in a slightly different way, in an effort to facilitate further develop- 
ment of the theory for their model. In their 2006 paper, Chaudhuri and Dasgupta showed 
that under certain conditions, strict stationarity of the "input" sequence would be pre- 
served by the "output" sequence, and they proved a similar "preservation" result for the 
property of strong mixing with exponential mixing rate. In our setup, we shall in spirit 
slightly extend their "preservation of stationarity" result, and also prove a "preservation" 
result for the property of absolute regularity with summable mixing rate. 
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1. Introduction 



Chaudhuri and Dasgupta [6] formulated and studied a certain stochastic model for 
"replicating character strings" . In that paper, they cited numerous other references where 
other related models had been studied, and in particular, they cited the book by Waterman 
[17] for the possible application of central limit theory under strong mixing conditions in 
the use of such models for the statistical analysis of data from biology (e.g. involving DNA 
sequences). In this note, we shall contribute further results and techniques to the theory 
for the particular model in [6], and suggest a way of setting up their model that may allow 
slightly easier handling of certain technical details. 

Let N (resp. Z) denote the set of all positive integers (resp. the set of all integers). 

The model studied by Chaudhuri and Dasgupta [6] can be briefly described as follows: 
It starts with an "input" sequence X := (Xk, k e N) of random variables taking their 
values in some finite "alphabet" — for example the set {A, C, G, T} of letters that represent 
the nucleotides in a DNA sequence. There is another sequence Z := (Z^, k G N) of random 
variables taking their values in the set {M, I, D} — to indicate that at a given "time" (or 
"location") fc, there should be a "mutation" (M), "insertion" (I), or "deletion" (D). (This 
sequence Z is informally referred to below as the "MID-sequence" .) Probabilities are 
assigned for what letter of the "alphabet" is inserted when an insertion occurs, or what 
letter of the alphabet results from a mutation. (The — perhaps high — probability of "no 
mutation" is formally represented in this scheme as the probability of "replacing a letter 
by itself" when a "mutation" occurs.) At the end, the result is an "output" sequence 
Y := (Yfc, k e N) of random variables, with the same "alphabet" (e.g. {A, C, G, T}) as the 
"input" sequence X. 

In their paper, Chaudhuri and Dasgupta [6, Theorems 3.1 and 3.2] established certain 
conditions under which certain properties of the "input" sequence — specifically, strict 
stationarity, and strong mixing with exponential mixing rate — would be retained by the 
"output" sequence. Chaudhuri and Dasgupta [6] set up their model using ("one-sided") 
random sequences indexed by N, as described above. In Section 3, we shall set up their 
model again, but using ("two-sided") random sequences indexed by Z. This will hopefully 
make it a little easier to handle various technical details, such as keeping track of relevant 
a-fields when estimating mixing rates. 

In the statements of their main results (though not in the initial formulation of their 
model), Chaudhuri and Dasgupta [6] dealt with the case where the MID-sequence is an 
(irreducible, aperiodic) Markov chain that is independent of the the "input" sequence. We 
shall retain that "independence" assumption, but allow the MID-sequence itself to satisfy 
a somewhat more flexible dependence assumption than a "Markov" property. 

Instead of studying the mixing rates for the "input" and "output" sequences for the 
strong mixing condition, we shall do so for the absolute regularity condition, which is 
stronger than strong mixing. That will provide an opportunity to illustrate the use of 
a particularly handy "coupling" property (due to Berbee [1]) that is possessed by the 
absolute regularity condition but not by the strong mixing condition. However, along the 
way, we shall also give information that may be relevant to the further development of the 
theory for this model under the strong mixing condition. 
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Instead of studying the case of exponential mixing rates (for strong mixing) as in [6] , 
we shall focus on a certain slower ("summable") mixing rate (for absolute regularity) that 
is natural in central limit theory for bounded random variables (under either strong mixing 
or absolute regularity). 

In the model in [6], one somewhat tricky facet of keeping track of relevant cr-fields 
was keeping track of the changes in the "clock" resulting from "deletions" . We shall adopt 
an alternative technical procedure — switching to a new probability measure based on 
conditioning on a certain event — in the hope of slightly simplifying that task. 

In their model, Chaudhuri and Dasgupta [6] assumed a finite state space (for the 
"input" and "output" sequences), as described above. That is the case of primary interest; 
but for convenience, we shall relax that assumption and allow the "input" and "output" 
sequences to consist in essence of real-valued random variables. We shall actually treat 
those random variables as taking their values in (0, oo) (think of "coding" a real number x 
by the positive number e x ), and in an intermediate stage reserve the value as a temporary 
"place holder" where an "insertion" will ultimately occur. 

The model in [6] directly involved probability mass functions for what happens when 
a "mutation" or "insertion" occurs. As a measure-theoretic convenience, we shall handle 
that in a slightly different way, using independent random variables uniformly distributed 
on the unit interval as "randomizers" . 

In making these modifications, we shall not change the actual model studied by Chaud- 
huri and Dasgupta [6] in any significant way. The modifications here only involve how their 
model is set up. Our quest here is in part to facilitate further development of the theory 
for their model. There is of course the practical question, not addressed here, of to what 
extent inaccuracy may occur when, say, a "long but finite" DNA sequence is modeled as a 
"two-sided" random sequence. 

In Section 2, some preliminary information on both the strong mixing and absolute 
regularity conditions will be given. In Section 3, the model in [6] will be spelled out with 
the modifications in the setup described above. Then in Section 4, the main result of this 
note will be stated and proved. 

2. Preliminary information on two mixing conditions 

In the development of the material in Sections 3 and and 4 below, we shall start with a 
probability space, and then switch to a new probability measure (on the same measurable 
space) obtained by conditioning on a certain key event. Accordingly, in the notations in 
definitions below, the relevant probability measure will be specified explicitly. If only one 
probability measure, say P, is specified, then the notation E(. . .) will be tacitly understood 
to mean the expected value with respect to that particular probability measure P. 

Suppose (O, J 7 ) is a measurable space. Suppose W := (Wi,i € J) is a random vari- 
able/vector or stochastic process indexed by a nonempty set I — that is, W : O — > H 1 
is a function which is measurable with respect to the cr-field J 7 on Q and the Borel a- 
field on R/. The a- field (c J 7 ) of subsets of O generated by W will be denoted a(W) or 

a{W h iel). 

Definition 2.1. Suppose (O, J 7 ) is a measurable space, and P is a probability measure 
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on (fi, J 7 ). 

For any two cr-fields A and B C J 7 , define the following two measures of dependence: 



a(.4,B; P) : 



sup |P(AnP) -P(A)P(P)|; and 



(2.1) 



AeA,BeB 



(3(A, B; P) := snp-J2Yl l p (^ n B i) " P (^*) P ( S i)l 



(2.2) 



where in (2.2) the supremum is taken over all pairs of partitions {A\, A2, . . . , Aj} and 
{£?!, B 2 , ■ ■ ■ , Bj} of O such that Ai e A for each z and Pj G £> for each j. It is easy to see 
that for any two a-fields A and B, one has that 



Suppose X := (X k , k G Z) is, with respect to P, a strictly stationary sequence of 
random variables. For each n G N define the dependence coefficients 



One trivially has that each of the sequences of numbers (ot(X, n; P), n G N) and 
(fi(X, n; P), n G N) is nonincreasing. Also, by (2.3), a(X,n;P) < (3(X,n;P) for every 
positive integer n. The sequence X is (with respect to the probability measure P) "strongly 
mixing" [14] if a(X, n; P) — > as n — > 00, and "absolute regular" [16] if /3(X, n; P) — )■ as 
n — > 00 By (2.3), absolute regularity implies strong mixing. 

To motivate the results later in this paper, we shall state a classic theorem of Ibragi- 
mov, from Ibragimov and Linnik [10, Theorem 18.5.4]. 

Theorem 2.1 (Ibragimov). Suppose that on a probability space (O, J 7 , P), X := 
(Xfc, k G Z) is a strictly stationary sequence of bounded, centered random variables such 
that J2^Li ol(X, n; P) < 00. 

Then a 2 := EXq + 2 • Y^?=i EXqX u exists in [0, 00), with this sum being absolutely 
convergent. If further a 2 > 0, then (Xi +X 2 + ■ ■ . + X n )/(n 1 / 2 a) converges in distribution 
to the iV(0, 1) law as n — > 00. 

That theorem will not be used anywhere in what follows, but it will provide the mo- 
tivation for the mathematical development in this paper. For example, in a statistical 
analysis of DNA data, one might deal with indicator functions ({0, l}-valued random vari- 
ables) marking the locations of a particular pattern of nucleotides along a DNA sequence. 
Thus if strong mixing is assumed as part of the statistical model, then it might be natural 
to apply a central limit theorem for bounded strongly mixing sequences of random vari- 
ables, such as Theorem 2.1. Now the (summable) mixing rate in Theorem 2.1 is practically 
sharp. That was shown to be true even under absolute regularity, by counterexamples of 
Davydov [7, Example 2] and the author [4]. The counterexample in the latter paper is 



a(A, B-P) < (3{A,B; P). 



(2.3) 



a(X,n;P) := a(a(X k ,k < 0),a(X k ,k > n);P); and 
0(X, n; P) := f3(<j(X k , k < 0), a(X k , k>n);P). 



(2.4) 
(2.5) 
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a strictly stationary, 3-state sequence that satisfies absolute regularity with ("not quite 
summable") mixing rate /3(X, n; P) = 0(l/n). Theorem 2.1 seems to be a natural one to 
use when either strong mixing or absolute regularity is assumed in the modeling of DNA 
sequences; and it is the summable mixing rate in that theorem that we shall focus on in 
this note. 

It is worth noting that Merlevede and Peligrad [11] proved (as a special case of the 
main result in their paper) a modified, refined version of Theorem 2.1 with the barely 
slower mixing rate a(X, n;P) = o(l/n) and an explicit extra assumption on the rate of 
growth of the variances of partial sums. 

As was mentioned in Section 1, instead of dealing with the strong mixing condition, 
we shall deal with absolute regularity. That will provide an opportunity to illustrate the 
use — in Steps 5 and 6 of the proof of Lemma 4.4 in Section 4 — of a handy "coupling" 
property (from [1]) of the absolute regularity condition. That property does not exist, at 
least in as strong a form, under just strong mixing. The next three lemmas will facilitate 
that particular application of that coupling property. 

Lemma 2.1. Suppose (O, J 7 , P) is a probability space, N is a positive integer, A n and 
B n , n G {1,2,..., N} are a-Gelds C J 7 , and the a -Gelds A n V B n , n G {1,2,..., N} are 
independent (under P). Then 

N N N 

n=l n=l n=l 

In one form or another, this has long been part of the folklore; see e.g. Pinsker [12, 
p. 73]. One reference for the particular formulation here is [5, vl, Theorem 6.2]. (That 
reference also gives the exactly analogous inequality for the dependence coefficient «(.,.).) 

The next lemma has long been part of the folklore; but a reference for it seems hard 
to find. In this lemma, the random variables X and Y are not assumed to be identically 
distributed. In this lemma, the term "Borel space" means a measurable space (S, S) that 
is bimeasurably isomorphic to the space (R, 1Z) where 1Z denotes the Borel cx-field on R. 
It is well known that R N (or R z ), accompanied by its Borel cr-field, is a Borel space. 

Lemma 2.2. Suppose (S,S) is a Borel space. Suppose (O, J 7 , P) is a probability space, 
X and Y are random variables on this space which take their values in (S,S), and A is a 
cr-field C T. Then 

\0(A, <?{X);P) - 0(A, a(Y);P) | < 2 • P(X ? Y). (2.6) 

Proof. By symmetry, it suffices to prove that f3(A,a(X); P) < f3(A,a(Y); P) + 2 • 
P(X 7^ Y). Suppose {Ai, A%, . . . , Aj} and {B>i, B>i, ■ ■ ■ , Bj} are each a partition of O, 
with Ai G A for each i and Bj G cr(X) for each j. It suffices to show that 

1 1 J 

n B i) ~ P ( A ^ P ( B j)\ < °( Y );P) + mX ± Y). (2.7) 

i=i j=i 
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By a well known measure-theoretic fact (a standard elementary generalization of [2, 
Theorem 20.1]), there exists a partition {Si, S 2 , ■ ■ ■ , Sj} of S with Sj G S for each j, such 
that Bj = {X G Sj} for each j. 

For any event A, 

J 

J2\P(An{XeS J })-P(An{YeS J })\ 
j 

< \ p ( A r\{x e Sj} n{x = y}) + P(A n {x g ^} n {x ^ y}) 
i=i 

- P(A n{Y e Sj}n{x = Y}) - p(A n {Y e Sj} n {x ^ Y}) \ 

j 

= \ p ( A r\{x e Sj} n {x ^ y}) - P(A n {Y g Sj} n{x ^ Y})\ 

< 2P(An {X ^ Y}). 

Applying that with A = Ai and then also with A = fi, one has that 

Y J Y.\ P ^ B o)- p iAi)P{Bj)\ 

i=l 3=1 

I J 

< E E n { x G s i» - n { y G 

+ E E n { y e $}) - n A i)p{Y e ^)i 
i=i j=i 

i=l j=l 

< [£m A i^{X^Y})]+2p(A,a(Y);P) 
i=i 

+ E E G 5 i) - p ( x e s i)\ 

i=l j=l 

I 

< 2P(X ^ Y) + 2p(A, a(Y); P) + E^(^) ' 2P ( X ± Y )\ 

i=i 

= 4P(X^Y) + 2p(A,a(Y);P). 
Thus (2.7) holds. That completes the proof. 

The following lemma, the final item of Section 2 here, will play a key role (in Section 
4) in the comparison of the mixing rates for the "input" and "output" sequences in the 
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model in [6]. It will be applied for absolute regularity; but it holds under strong mixing 
(as stated and proved here) as well. 

Lemma 2.3. Suppose (H(0), H(l), H(2), H(3), . . .) is a nonincreasing sequence of non- 
negative numbers such that Yl^=oH{n) < 00 • Suppose that on some probability space 
(fi, J 7 , P), X := (Xk,k G Z) is a nondegenerate, strictly stationary sequence of random 
variables taking only the values and 1, such that Xl^Li &{X,n; P) < 00. Then 

00 

EH(X 1 + X 2 + . . . + X n ) < 00. (2.8) 

n=l 

Proof. Referring to the hypothesis (of Lemma 2.3), define the number 

p ■= P(X = 1) = EX > (2.9) 

Define the constant random variable £0 := 0; and for each positive integer n, define 
the partial sum S n := X\ + X2 + . . . + X n . By the hypothesis (of Lemma 2.3), the 
sequence X is strongly mixing and hence ergodic. Hence from (2.9), one has that S n — > 00 
(monotonically) a.s. as n — > 00. For technical convenience, without loss of generality 
(redefining the random variables X^ on a P-null set if necessary), we assume that that 
happens at literally every wGft. 

For each nonnegative integer j, define the random variable 

rjj := card{n E~N : S n = j}. (2.10) 

Then for every integer J > 0, 

j 

= max{n > : S n = J}. (2.11) 

j=0 

In what follows, for any real number x, let [x] denote the greatest integer < x. Also, 
in the calculations below, by the hypothesis (of Lemma 2.3), all sums and summands 
("numerical" or random) take their values in [0, 00] := [0, 00) U {00}, and hence one can 
change the orders of summations arbitrarily. 

Recall that for any nonnegative integer- valued random variable W, EW = J2^Li P(W 
> n). For each integer J > 0, by (2.11) and the trivial inequality P(S n < J) < 1, one has 
that 

j 00 j 00 00 

E (E^) = S P (^^- n ) = S P( ^- J) " 2J/P+ £ P(Sn<J). (2.12) 

j=0 n=l j=0 n=l n=[2J/p] + l 

Let us examine the last sum in (2.12). For each integer J > and each integer 
n > 2J/p, one has that J < np/2 and hence J — np < —np/2. Hence for each integer 
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J > 0, 



P ( S ™ ^ J ) ^ P ( Sn " n ^ - n ^/ 2 ) ^ S - np| > np/2) 

n=[2J/p]+l n=[2J/p] + l n=[2J/p] + l 



< (np/2)^E(S n - np) 4 < (16//) ^ n" 4 P(S n - np) z 

n=[2J/p] + l 



(2.13) 



n=l 



Extend the definition (2.4) to include n = there. Then here for each positive integer 



n, 



n-1 



n-1 



E(S n - np) 4 < (20, 000)n • (m + l) 2 a(X, m; P) + 24n 2 [ ^ a(X, m; P) . (2.14) 



m=0 



m=0 



This is a simple direct application of [5, v2, Theorem 14.63], which in turn is a convenient 
but crude version of a much sharper and more general inequality due to Rio [13, Theoreme 
2.1]. Keep in mind that since the random variables take only the values and 1, the 
("upper-tail") quantile functions in those particular statements in both references take 
only the values and 1. Eq. (2.14) can also be obtained directly, in a sharper form, from a 
careful calculation from Ibragimov's proof in [10, Theorem 18.5.4] of Theorem 2.1 (examine 
carefully the argument for [10, Lemma 18.5.2]). 

Our next task is to use (2.14) to show that the last sum in (2.13) is finite. From 
simple calculus, let C\ be a positive number such that Yln^q n_3 — Ciq~ 2 for every 
positive integer q. By the hypothesis (of Lemma 2.3), 



n-1 



oo oo 



n=l 



m=0 



J2[ n ~ 4 -n ' $^(™ + l) 2 a(X,m;P)] = n~ 3 (m + l) 2 a(X,m; P) 

m=0n=m|l 

oo 

oo. 



m=0 



Also, trivially by the hypothesis, Y^=i 



n~ 4 ■ n 2 



i 2 



< oo. Applying 



those two inequalities to (2.14), one obtains that the last sum in (2.13) is finite. 

Accordingly, defining the finite numbers C2 '■= (16/p 4 ) Y^=i n~ 4 E(S n — np) 4 and 
C3 := (2/p) + C2, one has by (2.12) and (2.13) that for every integer J > 0, 



e(J2vj)<2J/p + c 2 < c 3 (J + i). 



(2.15) 



3=0 



Now refer to the function H in the statement of Lemma 2.3. By the hypothesis, 
H(n) 4 as n — >■ 00. Using the notation S(n) for S n in subscripts, one has the equality of 
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nonnegative random variables (possibly taking the value oo) 



h(Su) = E H U) = E H U) ■ m 

n=l J=0 {ne'N:S(n)=j} j=0 



oo oo 



= E E ^ • [^(0 - ^ + !)] = E E^ w - H ^ + !)] • »&• 

jr'=0 i=j i=0 j=0 

Hence by (2.15), 

OO OO OO 1 

E H(Sn) < E [ (0 " # (< + !)] • + 1)] = C 3 E El^CO " # (< + 1)] 

n=l i=0 i=0 j=0 

oo oo oo 

= ^3 E E^ w - ^ + !)] = ^E ^0") < OO- 

Thus (2.8) holds. That completes the proof of Lemma 2.3. 

3. The model of Chaudhuri and Dasgupta, in "two-sided" form 

In this section, we shall spell out, step by step, the "replicating character string" 
model studied by Chaudhuri and Dasgupta [6]. As explained in Section 1, essentially the 
only changes here will be in the "style": (i) the use of ("two-sided") random sequences 
indexed by Z, rather than ("one-sided") random sequences indexed by N, and (ii) the 
trivial allowing of the "alphabet" or "state space" to be (0, oo) instead of just a finite set. 
In the presentation here, the essential mathematical substance of their model will not be 
changed at all. Much of the notations below will be taken directly from their paper. For 
convenient reference, the stages in this construction will be referred to as paragraphs (PI), 
(P2), etc. 



(PI). Suppose (O, J 7 , P) is a probability space. All random variables defined below, 
will be understood to be defined on this space. 



(P2). Suppose X := {X^,k G Z) is (under P) a strictly stationary sequence of 
random variables taking their values in the open half line (0, oo). 
(This is the "input" sequence, as in the model in [6].) 

(P3). Suppose Z := (Zj~,k G Z) is (under P) a strictly stationary, ergodic sequence 
of random variables taking their values in the set {M, I,D}, with this sequence Z being 
independent of the sequence X. Assume further that P(Zo = s) > for all three elements 
s G {M, /, D}. For technical convenience, without loss of generality (by ergodicity), assume 
that for every u G O and all three elements s G {M, I, D}, Zf-(u>) = s for infinitely many 
negative integers k and infinitely many positive integers k. 
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(Again, the letters M, /, and D, stand for "mutation", "insertion", and "deletion"; 
Z is the "MID-sequence" , as in [6].) 

(P4). Define the strictly increasing sequence ( := (Q, j G Z) of integer- valued random 
variables as follows: For every w£S], 

. . . < C-2M < C-iM < CoM < < 1 < Ci(w) < C2H < Cs(w) < • • • and 



The random variables Cfc will sometimes be written ((k) for typographical convenience. 

(P5). Define the sequence X := (Xk, k G Z) of random variables (taking their values 
in the closed half line [0, 00)) as follows: For every weft, 



Wk G Z, X c(fc)(w) (o;) := X k (u); and Vj £ {aM : k G Z}, X,.(o;) := 0. (3.2) 



The state is used here only as a "temporary placeholder" for a spot where an 
"insertion" will eventually occur (in eq. (3.5) below, in the case where Yi(u) = there). 
That was the sole motivation for choosing for the original sequence X a state space, namely 
(0, 00), that does not include 0. 

(P6). Define the strictly increasing sequence £ := G Z) of integer- valued random 
variables as follows: For every u G O, 



These random variables will sometimes be written 

(P7). Define the sequence Y := (Yi,£ G Z) of random variables (taking their values 
in [0, 00)) as follows: For every 00 G O, 



(P8). (i) Let U := (Uk, k G Z) be (under P) a sequence of independent, identically 
distributed random variables, each uniformly distributed on the interval [0,1], with this 
sequence U being independent of the pair of sequences (X, Z) . 

(ii) Let g : (0, 00) x [0, 1] — > (0, 00) be a Borel function. 

(iii) Let h : [0, 1] — > (0, 00) be a Borel function. 

(iv) Define the sequence Y := (Yf, I G Z) of random variables (taking their values in 
(0, 00)) as follows: For every u G O, 



{j G Z : Zj(u) G {M, £>}} = {( k (u) : k G Z}. 



(3.1) 



. . . < £_ 2 (w) < £_iH < f (w) < < 1 < £i(w) < £ 2 (w) < £ 3 (w) < . . . and 
{j G Z : Z» G {M, /}} = {£ fc (u;) : G Z}. 



(3.3) 



WGZ, Y e (u) :=X m(u) (u). 



(3.4) 




(3.5) 
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(This sequence Y is the "output" sequence, as in [6].) 

The final two "paragraphs" below give a few more items that were not needed in the 
formulation of the "output" sequence Y but will be needed in the formulation of the main 
result (Theorem 4.1 in Section 4). 

(P9). Let P denote the probability measure on (fi, J 7 ) defined as follows: 

VP G T, P (F) := P(F\Z G {M, /}) = P(F|£ = 0). (3.6) 

(The second equality follows from paragraph (P6).) 

(P10). Define the sequence V := (14, k G Z) of ({M, 1} x N)-valued random variables 
as follows: 

VfcGZ, V k :=(Z m ^ k -Ck-i). (3.7) 

Also, define the sequence T := (Yk,k G Z) of ({M, 1} x N x (0, oo))-valued random 
variables as follows: 

va;gz, T fc :=(Vfc,y fc ) = (z €(fc) ,e fc -efc-i,n). (3.8) 

That completes the ("two-sided") presentation of the model in [6]. 

Remark 3.1. Here are several comments pertaining to the model from [6] as spelled out 
in paragraphs (P1)-(P10) above. 

(A) It is well known from renewal theory that even though the sequence Z is (under 
the original probability measure P) strictly stationary, the sequence (Z^^k G Z) is in 
general not strictly stationary under P. As a consequence, under P the "output" sequence 
Y will in general not be strictly stationary. To obtain the stationarity of Y, Chaudhuri and 
Dasgupta [6, Theorem 3.1] assumed directly that that sequence (Z^^, k G Z) (though not 
necessarily the entire sequence Z) is strictly stationary, with the original MID-sequence 
Z itself being a Markov chain with certain properties. Theorem 4.1 in Section 4 (the 
main result of this note) will employ the procedure, common in renewal theory, of formally 
switching to the new probability measure Pq i n (3-6), which under our assumptions will 
yield the stationarity of Y (and of the entire sequence T in (3.8)), with no "Markov" 
assumption. That is the role here of the probability measure Pq. 

(B) By paragraphs (P4), (P6), and (P10), one has that a((,£,,V) C a(Z). Recall 
from paragraphs (P3) and (P8) that under P, the random sequences X, Z , and U are 
independent of each other. By (3.6) and a trivial argument, those three sequences are 
independent of each other under Pq as well. Also by (3.6), the random sequences X and 
U (but in general not Z or even V) each have the same distribution under Pq as they do 
under P. In particular, 

Vn G N, (3(X, n; Pq) = (3(X, n; P) (3.9) 

(and the analogous equality holds for «(.,.)). 

(C) In paragraph (P3), it was implicitly understood in the phrase "without loss of 
generality" that on a certain "bad" event F with P(F) = 0, one might need to redefine 
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certain random variables Z k ,k G Z. By (3.6) Po(F) = as well, and hence that phrase 
"without loss of generality" applies under Pq as well as under P. 

(D) In [6] (with finite "alphabet"), the probabilities involving "mutations" and "inser- 
tions" were specified directly. In [6] it was also pointed out how the context of "mutation" 
could include, as part of the model, "high probability of no mutation". Paragraph (P8) 
just gives an alternative way to set all that up, using the independent random variables U k 
uniformly distributed on the interval [0, 1] as "randomizers" , and using appropriate choices 
of the Borel functions g (to determine "mutations") and h (to determine "insertions"). In 
particular, for a given x G (0, oo), the function g(x, u),u G [0, 1] can be defined to be equal 
to x itself ("no mutation") for u in "most" of the interval [0, 1]. 

4. The main result and its proof 

This section is devoted to the proof of the following theorem, the main result of this 
note: 

Theorem 4.1. Assume the entire context of paragraphs (P1)-(P10), with all assumptions 
there satisfied. 

(I) Under the probability measure Pq in (3.6), the sequence T (in paragraph (P10)) is 
strictly stationary (and hence under P the sequences V and Y are each strictly stationary). 

(II) If also £~ =1 /3(X,ra;P) < oo (see also (3.9)) and £~ =1 0(V, ra; P ) < oo, then 
£~ i m n; Po) < £~ i n; Po) < oo. 

Statement (I) is in spirit a slight extension of [6, Theorem 3.1], which in their setup was 
a corresponding "preservation of strict stationarity" result involving a "Markov" assump- 
tion on the MID-sequence. Statement (II) was inspired by [6, Theorem 3.2], which in their 
setup was a corresponding "preservation of mixing rate" result involving strong mixing 
with exponential mixing rate. It seems clear that the setup here in paragraphs (P1)-(P10), 
involving "two-sided" random sequences, can facilitate the proofs of such "preservation of 
mixing rates" results involving absolute regularity, such as Statement (II) here; but it is 
yet to be determined to what extent the setup here might facilitate such results involving 
strong mixing. 

We shall first prove Statement (I). The proof given below will be a somewhat modified 
version of the argument for [6, Theorem 3.1]. The argument will proceed through a series 
of lemmas. The first lemma is of a standard form. (In closely related contexts, a very 
similar fact was used in [3, proof of Lemma 5] and in [4, pp. 7-8]; see also [5, v3, Theorem 
26.4(1)].) 

Lemma 4.1. In the context of paragraphs (P1)-(P10) (with all assumptions there satis- 
fied), the sequence ((Z k , X k ), k G Z) is, under the probability measure P, strictly station- 
ary. 

Sketch of proof. Suppose j is any integer. Define the integer-valued random variable 
T := maxjfc G Z : ( k < j}. The entire array ((Z k ,k > j + 1), (X k ,k > j + 1)) can 
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be represented as cf>((Zk,k > j + 1), (Xt+i, Xt+2, Xt+3, ■ ■ ■)), where the (measurable) 
function^ : {M, I, £>} N x (0, oo) N ->• {M, /, D} N x [0, oo) N does not depend on j. Under P, 
regardless of j, by the assumptions in paragraphs (P2)-(P5) and an elementary argument, 
the sequence (Xt+i, Xt+2, Xt+3, ■ ■ •) is independent of a(Z,T) (= o~(Z)) and has the 
same distribution as the sequence (Xj., X 2 , X 3 , . . .), and Lemma 4.1 then follows easily. 

Lemma 4.2. Suppose L > 3 is an integer. Suppose that for each I G {1,2, ...,L}, 
sg G {M, /}, Ng is a positive integer, and Bg is a Borel subset of [0, oo). For each J G 
{1,2, . . . , L — 1}, define the event (see (3.7) and (3.3)) 

L 

Fj -.= [z e {M,/}}f|[f|({y_ J+ , = (sg,Ng)}n{Y.j +£ e (4.1) 

£=1 

Then P(F 1 ) = P(F 2 ) = ... = P(F L ^). 

Proof. Define the integers m := and m £ := N± + N 2 + . . . + Ng for i G {1,2,..., L}. 
These integers will sometimes be written below as m(£). Define the set S := {1, 2, . . . , 
^l} - {mi,ffl 2 ,...,m L }. 

Suppose J G {1, 2, . . . , L — 1}. 

By (3.3), {Z G {M, /}} = {£ = 0}. As a consequence, one has the equality of events 

L L 

[Z g{M, 7}}f| [f| {£_ J+e - C-j+£-i = Ng}j = f| {£_j +e = -mj + m e } 



£=0 



= [(\{Z_ m(J)+m{e) e {MJ}]f][f]{Z_ m{J)+u = D} . 

£=0 



(4.2) 



ues 



Referring to (4.1) and applying both equalities in (4.2) carefully, one obtains 

Fj={z e{M,l}}f) 

L 

[fl = s,} n {Z-j+g - Z-j+i-i = Ng} n {X ?( _ J+ , } g 54 

L L 

= [P)^- J +^ = ~ mj + m 4]Pl[n(^- m ( J )+ m W = S ^ n { X -m(J)+m(£) £ 
£=0 £=1 

r L 

= {Z_ m(j) G {M,/}}P p| ({Z_ m(J)+m(/) = sg} n {X 

n[n^ 



£=1 

-m(J)+u = D} 



ues 



(4.3) 



By Lemma 4.1, the probability (under P) of the last expression in (4.3) does not depend 
on J (G {1, 2, . . . , L - 1}). Thus Lemma 4.2 holds. 
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Lemma 4.3. The sequence ((Vg,Yi),£ G Z) of (({M,I} x N) x [0, oo))-valued random 
variables is strictly stationary under P . 

Proof. Suppose j 6 Z and n G N. It suffices to prove that under P , the "random 
vectors" ((V e ,Y e ),£ G + j + 2, . . . , j + n}) and ((V e ,Y e ),£ G {j + 2, j + 3, . . . , j + n + 1}) 
have the same distribution (on ({M, 1} x N x [0, oo)) n ). 

Let J and L be positive integers such that {J — 1, J} C {1, 2, . . . , L — 1} and {j + 
l,j + 2,...,j + n} C { — J + 1, — J + 2, . . . , — J + L}. It suffices to prove that under P , 
the "random vectors" ((Vi,Y e ),e G { — J + 1, — J + 2, . . . , — J + L}) and ((V^,F^),£ G 
{—J + 2, —J + 3, . . . , —J + L + 1}) have the same distribution. But that holds by (3.6), 
Lemma 4.2, and a trivial calculation. Thus Lemma 4.3 holds. 

Proof of Statement (I) in Theorem 4-1- By paragraph (P8) (see Remark 3.1(B)), the 
sequence U is, under P , independent of the sequence ((Vi,Ye),i G Z). It follows from 
paragraph (P8) and Lemma 4.3 (again see Remark 3.1(B)) that under P , the sequence 
((Ve, Ye, Ue), £ G Z) is strictly stationary. Now Statement (I) in Theorem 4.1 holds by (3.5) 
and (3.8). 

The proof of Statement (II) in Theorem 4.1 will be based on the following lemma. In 
what follows, Eq{. . .) denotes expected value with respect to the probability measure Pq. 
The indicator function of a given event A will be denoted 1(A). 

Lemma 4.4. In the context of (P1)-(P10) (with all assumptions there satisfied), suppose 
also that X^Li P(X, n; P) < oo. Define the sequence (H(n), n G {0} fl N) of nonnegative 
numbers as follows: For each n > 0, H(n) := /3(X, n + 1; P). 
Suppose N is an integer such that N > 2. Then 

N-l 

p(T, N ; P ) < 0(V, N; P ) + 2E H ( £ I(Z m = M)) . (4.4) 

i=i 

Proof. The proof of this lemma will proceed through a series of "steps" . 

Step 1. Refer to the integer iV > 2 in the hypothesis (of Lemma 4.4). Define the 
nonnegative integer-valued random variable T by 

T := card j/c G N : 1 < k < £iv - 1 and Z k G {M, L>}| 

= max{j G {0} U N : Q < £ N }. (4.5) 

(The second equality in (4.5) holds by (3.1).) Define the ("one-sided") sequence X* := 
[X{, X|, Xg , . . .) of random variables as follows: 

VA; > 1, Xt := X k+T . (4.6) 

Step 2. Now let us first look at the random variable Yn. Suppose u G O. If 
%A0(c)M = I, then £ N (u) £ {Ck{oo) : k G Z} by (3.1), and Y N (u) = by (3.4) and 
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(3.2). If instead Z^ N ^^(u) = M (the only other possibility, by (3.3)), then for some 
q > 1, 6vM = C«M> hence g = T(w) + 1 by (4.5), and hence Y N (u) = X^ q ^(u) = 
X q (u) = I r(w)+1 ( W ) = X*(u;) by (3.4), (3.2) and (4.6). 

Thus Y N = • I(Z m) =I)+ X* ■ I(Z m) = M). Hence by (3.7), 

a(Y N )ca(V N ,X*). (4.7) 

Step 3. Now suppose £ is any integer such that £ > N. Our task here in Step 3 is to 
obtain some sort of analog of (4.7) for Yg. 
First define the random variable 

t : = card j/c e N : £ N < k < & and Z k e {M, £>}}. (4.8) 

Then r = [ELjv = M )] + Ei=JV+i(& - Ci-i - !)]• Here in the ri S ht hand side > 

for a given w G O, by (3.3), the first sum is simply the number of indices k in the set in 
(4.8) such that Z k (uS) = M, and the second sum is simply the number of indices k in that 
set such that Z k (u) = D. (Either sum can be zero.) From that expression for r, one has 
by (3.7) that 

a(r)ca(V N ,V N+ll ...,V e ). (4.9) 

Now suppose u G O. Consider first the case where Z^^(u>) = M. Then for some 
q > 1, £ e (u) = ( q (u>); and by (4.5), (4.8), and (3.1), q = T(u) + t(u). Hence by (3.4), 
(3.2), and (4.6), Y i{u) = X ^^^(u) = X q (u) = X*^(u). Also in the case where 
Z£(e)(u>)( u ) = M, one has that r(u) > 1 by (4.8). If instead Z^^(u) = I (the only other 
possibility), then Y e (u) = by (3.4) and (3.2). 

Thus (for our given £ > N), putting all those pieces together, 

oo 

Y e = 0- I(Z m =I)+ X* ■ I(Z m = M) = + J2i X t ■ I(t = t) ■ I{Z m = M)], 

t=i 

and hence by (4.9) and (3.7), 

a(Y e ) C a(V N , V N+1 , ...,V e )V a (A*, A*, A*, ...). (4.10) 
Step 4- Combining (4.7) and (4.10), one now has that 

a(Y N ,Y N+1 ,Y N+2 , . . .) C a(V N , V N+1 , V N+2 , . . .) V a(X{, A 2 *, A 3 *, . . .). (4.11) 
Define the two random arrays A and B by 

A:= ({X k ,k<0);{V k ,k<0);{U k ,k<0)) and (4.12) 
B:= (x*;(V k ,k> N);(U k ,k> AT)). (4.13) 
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By (3.5), a(Y e ) C a(Y e , U e ) for each integer L One now has by (3.8) and (4.11) that 



a(T £ ,£>N) C a(B). 



(4.14) 



We need some sort of analog of (4.14) for a(Ti,£ < 0) and cr(A). 

For that purpose, we will need to work with the probability measure Pq (i n (3-6)). 
Some more notations will be needed. For events A and B, the notation A = B will mean 
that Po(AAB) = 0, where A denotes the symmetric difference. For an event A and a 
cr-field £>, the notation AeB will mean that A = B for some B e B. For a-fields A and 
£>, the notation .Ac£> will mean that AeB for every A e A, and the notation ^4, = B will 
mean that A<zB and BcA. 

Refer to (3.7) and both equalities in (3.6). For u G {£o = 0}, the "ordered pairs" 
(V k (u),k < 0) determine ("measurably") the set of integers {£ k (u),k < 0} as well as 
Zj{uS) (M or 7) for j in that set, and hence determine Zj(u) for all j < (since Z/(u;) = -D 
for integers j ; < that are not in that set). Combining that with (3.3), one obtains 



that <r(V k ,k < 0) =_ a(Z k , k < 0). Now a(X k ,k < 0) C a(X k , Z k , k < 0) by (3.1) 
and (3.2); hence a(Y k ,k < 0) C cr(X k , Z k ,k < 0) by (3.3) and (3.4); and hence also 
a(Y k , k < 0) C a(X k , Z k , U k , k < 0) by (3.5). Hence by (4.12) and (3.8), 



Step 5. On the original probability space (O, J 7 , P) let Y be a random variable which 
(under P) is uniformly distributed on the interval [0, 1] and is independent of o~(X, Z, U) 
(recall paragraphs (P2), (P3), and (P8)). Now of course by the hypothesis (of Lemma 4.4), 
the sequence /3(X, n; P) — > as n — > oo (absolute regularity). At this point, we shall apply 
the "coupling" result of Berbee [1, p. 104, Theorem 4.4.7 and p. 106, the Note], which is 
closely related to the "maximal coupling" result of Goldstein [9] . As a convenient reference 
for Berbee's result, we cite [5, v2, p. 277, Theorem 20.7 and pp. 477-478, Lemma A1651]. 
The latter "lemma" in that reference simply involves the use of the random variable Y 
above as a "randomizer", and it is simply (a special case of) the version in Dudley and 
Philipp [8, Lemma 2.11] of a theorem of Skorohod [15]. Thereby there exists a sequence 
X' := (X' k , k e Z) of random variables with the following properties (under P): 



By paragraphs (P1)-(P8) and the properties of the random variable T, one has that under P 
the ex-fields cr(Y), &(X), cr(Z), and a(U) are independent. By (3.6) and a trivial argument, 
that holds under Po as well. Further, by (3.6) and a trivial argument, the distribution of 
the random array (Y,X,X', U) (on [0, 1] x R z x R z x [0, l] z ) is the same under P as it 
is under P. In particular, (4. 16)- (4. 18) all hold with P replaced with P (see also (3.9)). 

Thus under Pq, the following statements hold: By (4.19) the a-fields, a(X, X'), <r{Z), 
and ct{U) are independent; hence by (4.17) the cr-fields cr(X'), a(X k ,k < 0), cr(Z), and 



k < O)Ot(A). 



(4.15) 



the distributions of the sequences X and X are identical (under P); 
the a— fields cr(X') and a(X kl k < 0) are independent (under P); 
Vn e N, P(3k > n such that X' k ^ X k ) = 0(X, n; P); and 
a(X') C a(X,Y). 



(4.16) 
(4.17) 
(4.18) 
(4.19) 



16 



cr(U) are independent; and hence the cr-field cr(X') is independent of the cr-field a(U, Z) V 
a(X k ,k< 0). 

Step 6. Now referring to (4.5), define analogously to (4.6) the ("one-sided") sequence 
X'* := (X[*, X 2 *, X3*, . . .) of random variables as follows: 

V/c > 1, X'C := X' k+T . (4.20) 

Now consider an arbitrary event A C o~(U, Z) Va(X k , k < 0) such that -Po(^4) > 0. For 
each integer t > such that Pq{A fl {T = t}) > 0, one now has (note that cr(T) C cr(Z)) 
that 

£ (x'*\A n {T = 0) = £ ((^ +1 , x f ' +2 , x t ' +3 , . . oi A n {t = t}) 

= A)0^t+1) • • •) = ^0(^1 5 -^2) -^3' • • •)) (4-21) 

where Cq(. . .) (resp. £o(- ••!•••)) denotes the distribution (resp. conditional distribution) 
under Pq. Since the last term in (4.21) is "constant" (not depending on A or £), it follows by 
a simple standard calculation that for each such event A, Cq{X'*\A) = Cq(X[, X' 2l X' 3 , . . .), 
and also (consider the case A = Q) Cq{X'*) = Co(X[, X 2 , X%, . . .). Consequently, the 
sequence X'* is (under P ) independent of the cr-field a(U, Z) V a{X k , k < 0). 
Analogously to (4.13), define the random array B' as follows: 

B' := (X'*; (Vfc, k > N); (U k , k > iV)) . (4.22) 

Referring to the last sentence of the preceding paragraph and the third sentence after 
(4.19) (which together with paragraph (P8)(i) yields (3(U,N;P ) = 0), one has by (4.12), 
(4.22), Remark 3.1(B), and Lemma 2.1 that 

(3(<j(A), cr(B'); Pq) = (3{V, N ; P ). (4.23) 

Also, by (4.5), (4.13), (4.22), and (4.18) (and the fact a(T) C a(Z)), with the sums below 
taken over all nonnegative integers t such that Pq(T = t) > 0, one has (recall the sequence 
H( . ) in the statement of Lemma 4.4) that 

Po(B'^B) = P (X'* y^X*) = J2 p o( x> * ^ X*\T = t) -P (T = t) 

= ^ -Pq ( ( x t+ 1 ■> x t+2 > x t+3> ■■■)¥' (Xt+i ,X t+ 2,X t+s , . . .)\T = t)-P (T = t) 
= ^ -Pq ( ( x t+ 1 ■> x t+2 > x t+3 > ■■■)¥' ( x t+i ,X t+ 2,X t+s , . . .))-P (T = t) 

= E H(T). 

Hence by (4.12)-(4.15), (4.23), and Lemma 2.2 (and Theorem 4.1(1), proved above) 

p(T, N; Pq) < p(a(A), a(B);P ) < (3(a(A),a(B'); P ) + E H(T) 

= P(V,N;P )+2E H(T). (4.24) 
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Now by (4.5), T > ^iLi 1 H^^(i) = M). Also, the sequence H(n) (in the statement of 
Lemma 4.4) is nonincreasing as n increases. It follows that EqH(T) < EqH{Y^~^ I(Z^ 
= M)). Combining this with (4.24) one obtains Lemma 4.4. 

Proof of Statement (II) in Theorem Define the sequence G := (I(Z^ = M),z e 
Z) of random indicator functions. Under P this sequence is strictly stationary by (3.7) 
and Theorem 4.1(1) (proved above). By (3.6), paragraph (P3), and a trivial argument, the 
sequence G is also nondegenerate under Pq. Also, by (3.7) and the hypothesis (of Theorem 
4.1(H)), one has that Y^Li n 'i Po) < 00 • Also, by the hypothesis (of Theorem 4.1(H)), 
the (nonincreasing) sequence (H(n),n G {0} n N) of nonnegative numbers in Lemma 4.4 
is summable. Hence for that sequence H(.), by Lemma 2.3, Yl^=2 
= M)) < oo. Hence by (3.8), Lemma 4.4, and the hypothesis (of Theorem 4.1(H)), the 
conclusion of Theorem 4.1(11) holds. That completes the proof. 
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